Picture for Yefei He

Yefei He

ReCA: Multi-Shot Long Video Extrapolation via Recursive Context Allocation

Add code
May 26, 2026
Viaarxiv icon

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Add code
May 19, 2026
Viaarxiv icon

Dynamic Execution Commitment of Vision-Language-Action Models

Add code
May 12, 2026
Viaarxiv icon

FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation

Add code
May 10, 2026
Viaarxiv icon

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Add code
Apr 27, 2026
Viaarxiv icon

Less Detail, Better Answers: Degradation-Driven Prompting for VQA

Add code
Apr 07, 2026
Viaarxiv icon

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Add code
Dec 15, 2025
Viaarxiv icon

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Add code
Dec 09, 2025
Viaarxiv icon

OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs

Add code
Nov 18, 2025
Viaarxiv icon

ZipR1: Reinforcing Token Sparsity in MLLMs

Add code
Apr 23, 2025
Viaarxiv icon